AITopics | Warren County

Collaborating Authors

Warren County

Doubly Robust Fusion of Many Treatments for Policy Learning

Zhu, Ke, Chu, Jianing, Lipkovich, Ilya, Ye, Wenyu, Yang, Shu

arXiv.org Machine LearningMay-27-2025

Individualized treatment rules/recommendations (ITRs) aim to improve patient outcomes by tailoring treatments to the characteristics of each individual. However, when there are many treatment groups, existing methods face significant challenges due to data sparsity within treatment groups and highly unbalanced covariate distributions across groups. To address these challenges, we propose a novel calibration-weighted treatment fusion procedure that robustly balances covariates across treatment groups and fuses similar treatments using a penalized working model. The fusion procedure ensures the recovery of latent treatment group structures when either the calibration model or the outcome model is correctly specified. In the fused treatment space, practitioners can seamlessly apply state-of-the-art ITR learning methods with the flexibility to utilize a subset of covariates, thereby achieving robustness while addressing practical concerns such as fairness. We establish theoretical guarantees, including consistency, the oracle property of treatment fusion, and regret bounds when integrated with multi-armed ITR learning methods such as policy trees. Simulation studies show superior group recovery and policy value compared to existing approaches. We illustrate the practical utility of our method using a nationwide electronic health record-derived de-identified database containing data from patients with Chronic Lymphocytic Leukemia and Small Lymphocytic Lymphoma.

artificial intelligence, assumption 3, machine learning, (12 more...)

arXiv.org Machine Learning

2505.08092

Country:

North America > United States > Ohio > Warren County > Mason (0.04)
North America > United States > North Carolina > Wake County > Raleigh (0.04)
North America > United States > North Carolina > Durham County > Durham (0.04)
(3 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology > Leukemia (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Double Machine Learning meets Panel Data -- Promises, Pitfalls, and Potential Solutions

Fuhr, Jonathan, Papies, Dominik

arXiv.org Machine LearningSep-2-2024

Estimating causal effect using machine learning (ML) algorithms can help to relax functional form assumptions if used within appropriate frameworks. However, most of these frameworks assume settings with cross-sectional data, whereas researchers often have access to panel data, which in traditional methods helps to deal with unobserved heterogeneity between units. In this paper, we explore how we can adapt double/debiased machine learning (DML) (Chernozhukov et al., 2018) for panel data in the presence of unobserved heterogeneity. This adaptation is challenging because DML's cross-fitting procedure assumes independent data and the unobserved heterogeneity is not necessarily additively separable in settings with nonlinear observed confounding. We assess the performance of several intuitively appealing estimators in a variety of simulations. While we find violations of the cross-fitting assumptions to be largely inconsequential for the accuracy of the effect estimates, many of the considered methods fail to adequately account for the presence of unobserved heterogeneity. However, we find that using predictive models based on the correlated random effects approach (Mundlak, 1978) within DML leads to accurate coefficient estimates across settings, given a sample size that is large relative to the number of observed confounders. We also show that the influence of the unobserved heterogeneity on the observed confounders plays a significant role for the performance of most alternative methods.

confounder, dml, unobserved heterogeneity, (16 more...)

arXiv.org Machine Learning

2409.01266

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
North America > United States > Ohio > Warren County > Mason (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre:

Research Report > Promising Solution (0.64)
Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Bounding Causal Effects with Leaky Instruments

Watson, David S., Penn, Jordan, Gunderson, Lee M., Bravo-Hermsdorff, Gecia, Mastouri, Afsaneh, Silva, Ricardo

arXiv.org Artificial IntelligenceMay-8-2024

Instrumental variables (IVs) are a popular and powerful tool for estimating causal effects in the presence of unobserved confounding. However, classical approaches rely on strong assumptions such as the $\textit{exclusion criterion}$, which states that instrumental effects must be entirely mediated by treatments. This assumption often fails in practice. When IV methods are improperly applied to data that do not meet the exclusion criterion, estimated causal effects may be badly biased. In this work, we propose a novel solution that provides $\textit{partial}$ identification in linear systems given a set of $\textit{leaky instruments}$, which are allowed to violate the exclusion criterion to some limited degree. We derive a convex optimization objective that provides provably sharp bounds on the average treatment effect under some common forms of information leakage, and implement inference procedures to quantify the uncertainty of resulting estimates. We demonstrate our method in a set of experiments with simulated data, where it performs favorably against the state of the art. An accompanying $\texttt{R}$ package, $\texttt{leakyIV}$, is available from $\texttt{CRAN}$.

assumption, exclusion criterion, instrument, (16 more...)

arXiv.org Artificial Intelligence

2404.04446

Country:

North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Add feedback

Estimating Causal Effects with Double Machine Learning -- A Method Evaluation

Fuhr, Jonathan, Berens, Philipp, Papies, Dominik

arXiv.org Machine LearningMar-21-2024

The estimation of causal effects with observational data continues to be a very active research area. In recent years, researchers have developed new frameworks which use machine learning to relax classical assumptions necessary for the estimation of causal effects. In this paper, we review one of the most prominent methods - "double/debiased machine learning" (DML) - and empirically evaluate it by comparing its performance on simulated data relative to more traditional statistical methods, before applying it to real-world data. Our findings indicate that the application of a suitably flexible machine learning algorithm within DML improves the adjustment for various nonlinear confounding relationships. This advantage enables a departure from traditional functional form assumptions typically necessary in causal effect estimation. However, we demonstrate that the method continues to critically depend on standard assumptions about causal structure and identification. When estimating the effects of air pollution on housing prices in our application, we find that DML estimates are consistently larger than estimates of less flexible methods. From our overall results, we provide actionable recommendations for specific choices researchers must make when applying DML in practice.

confounder, dml, functional form, (17 more...)

arXiv.org Machine Learning

2403.14385

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
North America > United States > New York (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(5 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Government (1.00)
Health & Medicine > Therapeutic Area (0.93)
Law (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.67)

Add feedback

Dynamic Contexts for Generating Suggestion Questions in RAG Based Conversational Systems

Tayal, Anuja, Tyagi, Aman

arXiv.org Artificial IntelligenceMar-17-2024

When interacting with Retrieval-Augmented Generation (RAG)-based conversational agents, the users must carefully craft their queries to be understood correctly. Yet, understanding the system's capabilities can be challenging for the users, leading to ambiguous questions that necessitate further clarification. This work aims to bridge the gap by developing a suggestion question generator. To generate suggestion questions, our approach involves utilizing dynamic context, which includes both dynamic few-shot examples and dynamically retrieved contexts. Through experiments, we show that the dynamic contexts approach can generate better suggestion questions as compared to other prompting approaches.

dynamic context, query, suggestion question, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3589335.3651905

2403.11413

Country:

Asia > Singapore > Central Region > Singapore (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)
North America > Canada > Ontario > Toronto (0.05)
(6 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Recent Advances in Graph-based Machine Learning for Applications in Smart Urban Transportation Systems

Wu, Hongde, Yan, Sen, Liu, Mingming

arXiv.org Artificial IntelligenceJun-2-2023

The Intelligent Transportation System (ITS) is an important part of modern transportation infrastructure, employing a combination of communication technology, information processing and control systems to manage transportation networks. This integration of various components such as roads, vehicles, and communication systems, is expected to improve efficiency and safety by providing better information, services, and coordination of transportation modes. In recent years, graph-based machine learning has become an increasingly important research focus in the field of ITS aiming at the development of complex, data-driven solutions to address various ITS-related challenges. This chapter presents background information on the key technical challenges for ITS design, along with a review of research methods ranging from classic statistical approaches to modern machine learning and deep learning-based approaches. Specifically, we provide an in-depth review of graph-based machine learning methods, including basic concepts of graphs, graph data representation, graph neural network architectures and their relation to ITS applications. Additionally, two case studies of graph-based ITS applications proposed in our recent work are presented in detail to demonstrate the potential of graph-based machine learning in the ITS domain.

artificial intelligence, machine learning, prediction, (15 more...)

arXiv.org Artificial Intelligence

2306.01282

Country:

Europe > Ireland > Leinster > County Dublin > Dublin (0.14)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(6 more...)

Genre: Overview (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Top Start-ups to Use Artificial Intelligence in Mental Healthcare

#artificialintelligenceMay-31-2021, 12:35:07 GMT

The welfare of mental health has become as pertinent as physical healthcare nowadays. If artificial intelligence is used for the diagnosis of mental health, the results would be more accurate and it would be much easier for psychologists to find solutions. It would help a lot in the preliminary check-up and initial treatment. Analytics Insight has selected five such startups that are using AI technology in mental health treatment. Sentio Solutions develops biomarkers and digital therapeutics using artificial intelligence to come up with innovative ways of treating mental health issues.

mental health issue, mental healthcare, use artificial intelligence, (7 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.16)
North America > United States > Ohio > Warren County > Mason (0.05)
North America > United States > California > Santa Clara County > Palo Alto (0.05)
(2 more...)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.31)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.91)

Add feedback

The effect of measurement error on clustering algorithms

Pankowska, Paulina, Oberski, Daniel L.

arXiv.org Machine LearningMay-24-2020

Clustering consists of a popular set of techniques used to separate data into interesting groups for further analysis. Many data sources on which clustering is performed are well-known to contain random and systematic measurement errors. Such errors may adversely affect clustering. While several techniques have been developed to deal with this problem, little is known about the effectiveness of these solutions. Moreover, no work to-date has examined the effect of systematic errors on clustering solutions. In this paper, we perform a Monte Carlo study to investigate the sensitivity of two common clustering algorithms, GMMs with merging and DBSCAN, to random and systematic error. We find that measurement error is particularly problematic when it is systematic and when it affects all variables in the dataset. For the conditions considered here, we also find that the partition-based GMM with merged components is less sensitive to measurement error than the density-based DBSCAN procedure.

algorithm, dataset, measurement error, (16 more...)

arXiv.org Machine Learning

2005.11743

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Missing U.S. commando found dead in Niger desert two days after deadly ambush

Los Angeles TimesOct-6-2017, 23:56:02 GMT

After an intense two-day search, local military forces Friday recovered the body of a U.S. Army commando who was inadvertently left behind after a daylight ambush by militants killed three other Green Berets in a rugged border region in Niger. Pentagon officials had not previously announced that a Green Beret was missing in action after the surprise attack on a joint patrol of U.S. commandos and Nigerien troops Wednesday. Six of the 12 Americans on the patrol were killed or wounded. Officials hoped the missing U.S. Army Special Forces operative might still be hiding in the dense brush, rather than taken captive, and launched a massive search-and-rescue mission with aerial drones and other aircraft, as well as Nigerien ground forces. The death of four Green Berets in remote West Africa marks the worst single loss of U.S. forces under fire since President Trump took office.

ambush, artificial intelligence, green beret, (17 more...)

Los Angeles Times

Country:

Africa > Niger (0.64)
Africa > West Africa (0.26)
Europe > Middle East (0.06)
(8 more...)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.36)

Add feedback

Is machine learning smart enough to help industry?

#artificialintelligenceApr-4-2016, 23:20:41 GMT

Dave Perkon is technical editor for Control Design. He has engineered and managed automation projects for Fortune 500 companies in the medical, automotive, semiconductor, defense and solar industries. Put simply, the IoT provides the connection, the cloud provides online storage and convenient applications, and big data provides analysis, management and maintenance of information, which, when combined, can overwhelm the data users and decision makers. Fortunately computers and specifically machine-learning applications, although in their early stages, can help. From the industry or manufacturing side of business, machine learning can be applied to just about any control system that is smart enough to actually alter how it controls a machine in response to changing conditions, but there is much more to it than that.

artificial intelligence, data mining, machine learning, (17 more...)

#artificialintelligence

Country:

North America > United States > Ohio > Warren County > Mason (0.05)
Europe > Germany (0.05)

Industry: Energy > Renewable (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.36)

Add feedback